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1.  INTRODUCTION 


This  paper  deals  with  problems  of  the  following  kind:  Suppose  we  wish 
to  purchase  a  certain  item.  We  will  be  quoted  prices  for  the  item,  one  at 
a  time,  and  at  some  point  we  must  stop  listening  and  pay  the  price  we  have 
just  heard.  When  should  we  stop  in  order  to  pay  as  little  as  we  can? 

The  primary  motivation  for  this  paper  was  the  desire  to  sharpen  and 
extend  some  recent  results  of  Stewart  (1978) ,  who  studied  certain  of  these 
problems  from  a  Bayesian  point  of  view.  In  Stewart's  model  the  prices, 
X^,X2,...,Xn  form,  conditionally,  a  random  sample  from  a  uniform  distribution 
on  an  interval  (a,  (3)  and  the  end-points  have  a  prior  density  of  the  form 

k-1  k+1 

Pk(a,  (3 1  £,u)  =  k(k-l)  (u-£)  /  (8-a)  ,  -  <»  <  a  <  £  <  u  <  B  <  °°;  k  >  1.  (1.1) 

These  densities  were  chosen  for  mathematical  convenience.  They  are  conjugate; 
i.e.,  the  posterior  density,  say  f  (a,  8 1 X^ , .  .  .  ,X^.  )  ,  is  of  the  same  form  as 
the  prior  density,  namely 

f (a,3|xi5 . . . ,Xj)  =  Pk+j (a,B|£j ,u.)  (1.2) 

where 

jL  =  min(£,X^, . .  .  ,X_.  )  (1.3) 

and 

u^.  =  max(u,X^  , . .  .  ,Xj  )  .  (1.4) 

In  other  words  the  parameters  £  and  u  of  the  prior  density  (1.1) 
play  the  role  of  the  smallest  and  largest  of  k  previous  "pseudo-prices". 

(We  shall  regard  k  as  an  integer  in  (1.1)  although  it  need  not  be.)  This 
role  is  also  reflected  in  the  form  of  the  Bayes  rules  which  Stewart  derives 
when  the  loss  for  accepting  the  i-th  price  is  either  of  the  following: 
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(a)  the  Quantile  of  the  price:  (X  -a) /(8-a) 
or 

(b)  zero  or  one  according  as  is  or  is  not  the  "Best  Choice" 

which  is  defined  to  be  min{£,X^, . . . ,Xn}. 

(The names  Quantile  Problem  or  Best  Choice  Problem  will  be  used  whenever  we 
are  dealing  with  these  losses,  whether  or  not  there  is  a  pseudo-observation, 

M 

In  each  problem  the  Bayes  rule  —  which  is  the  policy  (i.e.,  stopping 
rule  based  on  the  prices)  that  minimizes  the  Bayes  risk  —  essentially  depends 
on  the  sample  size  n  and  the  prior  density  parameter  k  only  through  their 
sum,  say  m  =  n  +  k.  It  features  a  "learning  period"  of  the  form 
max{i(m) ,k+l}  -  k  which  is  the  subscript  of  the  earliest  price  which  can 
possibly  be  accepted.  Beginning  with  that  price,  acceptance  is  based  solely 
on  the  values  of  the  current  price,  X^,  of  jL  and  u^,  and  of  the  number 
of  prices  remaining  —  up  to  a  maximum  of  m-2  since  the  smallest  integer 
k  for  which  (1.1)  is  a  probability  density  is  k=2.  (Exact  expressions 
for  these  Bayes  rules  are  given  at  the  beginning  of  Section  4.)  Further¬ 
more,  while  the  risk  does  decrease  with  increasing  n  (or  decreasing  k) 
until  k  +  1  £  i(m),  it  then  remains  constant  for  all  larger  n  (or  smaller 
k)  . 

If  i(m)  is  at  least  3,  then  setting  k=0  in  the  above  policy  and 
eliminating  the  pseudo-prices  £  and  u  from  (1.3)  and  (1.4)  yields  a 
perfectly  legitimate  stopping  rule.  In  Section  4  we  shall  exploit  the  fact 
that  the  losses  (a)  and  (b)  are  invar iant  under  location  and  scale  trans¬ 
formations  to  show  that  this  policy  is  minimax  for  the  family  of  all  uniform 


distributions  among  all  stopping  rules  based  on  m  prices  and  that  its  risk 
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is  a  constant  equal  to  the  Bayes  risk  of  the  corresponding  Bayes  rule 
with  k=2.  Since  Stewart's  results  shows  that  i(m)  ^  3  for  m  ^  6  in 
the  Quantile  Problem  and  for  m  ^  5  in  the  Best  Choice  Problem,  this  very 
nearly  settles  the  minimax  problem.  And  the  minimax  policy  for  the  few 
remaining  values  of  m  will  also  be  given  in  Section  4.  The  results  of 
Section  4  utilize  a  general  framework  developed  in  Section  3  which  applies 
to  all  bounded  losses  which  share  the  invariance  property  of  the  Best  Choice 
and  Quantile  Problem  losses.  Section  3,  we  think,  constitutes  an  especially 
interesting  "concrete"  application  of  the  invariance  ideas  commonly  referred 
to  as  "Hunt-Stein  Theorems". 

Stewart's  goal  was  to  find  optimal  risks  as  well  as  optimal  rules  in 
order  to  assess  the  penalties  which  must  be  incurred  for  not  knowing  the  actual 
distribution  of  the  prices,  as  well  as  the  rewards  which  may  be  received  for 
being  able  to  actually  "measure"  the  prices  rather  than  to  know  only  the 
relative  rank  of  each  current  price  among  those  heard  so  far.  (The  relative 
rank  of  X^  among  the  first  i  prices  is  defined  to  be  j  if  X^  is  j-th 
smallest  among  *  *  * ’^i"  And  a  relative  rank  rule,  when  there  are  no 

pseudo-prices,  is  a  policy  in  which  the  decision  of  whether  or  not  to  accept 
a  given  price  is  based  solely  on  how  many  prices  have  been  heard  so  far  and 
what  is  the  relative  rank  of  the  current  price  among  them.)  By  considering 
only  uniformly  distributed  prices  Stewart  could  invoke  results  of  Gilbert 
and  Mosteller  (1966)  and  Chow  et  al.  (1964)  which  give  the  optimal  rules 
and  risks  in  the  two  extreme  cases.  These  are  reviewed  in  Section  2. 

Stewart  showed  that  in  the  Best  Choice  problem  there  is  no  reward  at 
all  for  being  able  to  observe  the  X^'s  themselves  rather  than  just  their 
relative  ranks.  Indeed  in  the  Bayes  Rule  the  decision  of  whether  or  not 
to  accept  the  i-th  price  is  based  solely  on  i  and  on  whether  or  not 
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X^  =  min{5-,X^ ,X^ , .  .  •  ,X_^} .  In  the  Quantile  Problem,  on  the  other  hand, 
there  is  such  a  reward.  Stewart's  numerical  results,  together  with  the 
previously  known  limits  reviewed  in  Section  2,  suggested  that  for  large 
n  the  risks  of  the  best  relative  rank  rule,  the  Bayes  rule,  and  the  optimal 
rule  when  the  distribution  is  known,  are  in  approximately  the  ratio  3.87  to 
3.48  to  2.  So,  of  the  difference  between  the  two  extreme  values,  about  79% 
is  due  to  uncertainty  about  which  uniform  distribution  is  being  sampled  from 
and  21%  to  suppression  of  information  when  we  use  only  relative  ranks  rather 
than  the  prices  themselves. 

That  exactly  the  same  results  hold  for  minimax  risks  is  virtually  im¬ 
plied  by  the  previously  remarked  intimate  relationship  between  the  Bayes 
rules  and  the  minimax  rules.  (The  one  small  loose  end  is  verifying  that 
the  minimax  policy  in  the  Best  Choice  Problem  is  a  relative  rank  rule  for 
m  £  5  as  well  as  for  m  >  5.) 

For  the  Quantile  Problem,  Theorem  5.1  confirms  the  limit  for  the  risks 
suggested  by  Stewart's  numerical  results.  Further  insight  is  provided  by 
the  monotonicity  results  of  Section  6.  And  the  asymptotic  performance  of 
an  appealing  sequence  of  simplified  policies  is  also  evaluated  in  Section  5. 
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2.  UPPER  AND  LOWER  BOUNDS  FOR  RISKS 

2.1.  Best  Policies  When  the  Distribution  is  Known 

Solutions  to  both  the  Quantile  Problem  and  the  Best  Choice  Problem 
when  the  end-points  a  and  0  are  known  can  be  found  in  Gilbert  and 
Mosteller  (1966).  They  are  as  follows: 

For  the  Quantile  Problem,  when  the  sample  size  is  n  the  optimal  policy 
is  to  stop  as  soon  as 

(X^-cO/CB-a)  £  vn_±  .  (2.1) 

Here  denotes  the  i-th  price  we  will  hear  and  the  v^'s  are  a  single 

decreasing  sequence  of  constants  (the  same  for  all  sample  sizes,  n )  which  are 
computed  recursively  from  the  formula 

vo"1:  vt+i  ■  V1- V2)  •  (2-2> 

The  v,  's  play  a  dual  role  in  this  problem:  v  is  also  the  risk  when  the 
k  n 

sample  size  is  n  and  the  optimal  policy  is  used.  And,  as  was  shown  by 
Gilbert  and  Mosteller, 


nv  2  as  n  -*■  co  .  (2.3) 

n 

Thus  if  n  is  large  the  expected  price  to  be  paid  —  expressed  as  a  quantile 
of  the  underlying  price  distribution,  is  about  2/n,  which  is  just  about  twice 
what  we  would  expect  to  pay  if  we  could  hear  all  n  prices  and  then  choose 
the  smallest  one. 

For  the  Best  Choice  Problem,  when  the  sample  size  is  n  the  optimal 
policy  is  to  stop  as  soon  as  X^  =  min{X^, . . . ,X^}  and 
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(X.-a)/(8-a)  £  d  . 

i  n-i 


(2.4) 


where  d„  =  1  and,  for  k  5  1,  d,  satisfies 
0  k 


d-d,  )k  =  i  r 1  (k)  d-d.  >k-jdi 


(2.5) 


j=l 


The  d^'s  are  decreasing  in  k,  and  writing 


dk  =  ck/(k+Ck}  ’ 


(2.6) 


it  follows  from  (2.5)  and  the  dominated  convergence  theorem  that 


c^  -*■  c  ~  .804352 


(2.7) 


where  c  is  the  solution  to 

OO 

l  cj/j.'j  =  1  .  (2.8) 

3=1 

The  probability  of  correctly  choosing  the  lowest  price  with  the  optimal  policy 
was  evaluated  by  Gilbert  and  Mosteller  for  various  sample  sizes.  This  optimal 
probability  of  best  choice  is  a  decreasing  function  of  sample  size,  with  limit¬ 
ing  value 

-  00  _i  _ 

e  c  -  (ec-c-l)  /  x  e  CXdx  s  .580164  (2.9) 

1 

with  c  as  in  (2.8).  The  value  .580164  was  obtained  by  numerical  methods  by 
Gilbert  and  Mosteller.  The  actual  algebraic  expression  for  the  limit  is 
shown  in  Appendix  A  of  this  paper  to  be  equal  to 
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P (Z (1-T)  <  c  <  (Z+  Z'/T) (1-TT’))  (2.10) 

where  Z,  Z',  T,  and  T'  are  mutually  independent  random  variables  with  Z 
and  Z'  each  exponentially  distributed  with  parameter  one,  and  T  and  T' 
each  uniformly  distributed  on  the  interval  (0,1).  A  proof  that  (2.10)  is 
indeed  the  limiting  optimal  probability  of  best  choice  is  sketched  in  Appendix 

A. 

(These  results  for  the  Best  Choice  Problem  apply  not  only  to  uniform 
distributions  but  also  to  any  known  continuous  distribution  F,  with  (2.4) 
replaced  by  "F(X1)  £  d  ./'.) 

2 . 2  Best  Relative  Rank  Rules 

A  familiar  linearity  property  of  uniform  distributions  —  that  the  mean 
of  the  i-th  order  statistic  of  a  sample  of  size  n  from  a  uniform  distribution 
on  (a, 3)  is  a+  i(3~o0/ (n+1)  —  yields  the  solution  to  the  Quantile  Problem 
in  the  restricted  class  of  relative  rank  rules  directly  from  the  result  of 
Chow  et  al.  (1964).  This  linearity  property  implies  that  for  any  relative 
rank  rule  the  expected  quantile  of  the  price  accepted  by  the  rule  is  just 
(n+1)  1  times  the  expected  rank  of  that  price.  Chow  et  al.  in  effect  found 
that  for  any  continuous  distribution  of  prices  the  minimal  expected  rank 
attainable  by  a  relative  rank  rule,  say  Cn>  satisfies 

co  l/(j+l) 

C  f  l  (1 +  2 j  )  ~  3.8695. 

n=l 

and  is  attained  by  a  single  stopping  rule  for  each  n,  regardless  of  the  dis¬ 
tribution.  Hence,  for  uniform  distributions  this  same  policy  minimizes  the 
expected  quantile,  and  the  minima,  say  r^,  satisfy 
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nrn  =  nC  /(n+1)  f  ~  3.8695  .  (2.11) 

The  Best  Choice  Problem  with  only  relative  rank  rules  permitted  is  an 
elementary  one  found  in  many  places  including  Gilbert  and  Mosteller  (1966) . 

For  any  continuous  distribution  of  prices  the  optimal  policy  when  there  are 
n  _  available  prices  is  to  stop  as  soon  as  is  the  lowest  price  heard  so 

far  and  i  ^  i(n)  where 

n-l  1  n-1 

l  x  <  1  £  l  l  .  (2.12) 

i=i(n)  i=i(n)-l 

This  policy  has  probability  p^  of  selecting  the  lowest  of  all  n  prices  where 

n_1  _i 

P  =  T  i(n)/ni  V  e  ^  .3679  .  (2.13) 

n  i=i(n)-l  * 

2 . 3  Other  Best  Choice  Problems 

Recently  Petruccelli  (1979)  has  studied  the  Best  Choice  Problem  where  the 
price  distribution  belongs  to  the  restricted  family  of  just  those  uniform  dis¬ 
tributions  with  3-01*1,  and  found  that  the  best  relative  rank  rule  is  not 
minimax,  in  contrast  to  the  result  for  the  full  class  of  all  uniform  dis¬ 
tributions.  By  adapting  the  method  used  to  get  (2.10)  Petruccelli  found  that 
the  asymptotic  minimax  probability  of  best  choice  is  ~  .43517. 

Petruccelli  also  gives  sufficient  conditions  under  which  the  asymptotic 
minimax  best  choice  probability  achieves  the  value  in  (2.9),  an  example  of 
which  is  the  family  of  all  normal  distributions. 
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3.  INVARIANCE  AND  MINIMAXITY 

3.1.  Invariance  of  the  Quantile  and  Best  Choice  Problems 

A  property  shared  by  both  the  Quantile  Problem  and  the  Best  Choice  Problem 

is  that  their  loss  functions  are  invariant  under  location  and  scale  transforma- 

/  \ 

tions.  What  this  means  is  the  following:  Let  W^  ' (x;a>y)  denote  the  loss 
for  accepting  the  i-th  of  the  n  available  prices  when 

x  —  (x^  > • • • 

is  the  vector  of  prices  and  a  and  a  +  y  are  the  end-points  of  the  underlying 
uniform  distribution.  We  call  the  loss  invariant  if  for  each  i  =  l,2,...,n 

wjn)(x;a,y)  E  w^n)((x-a)/y)  (3.1) 


where  we  use  the  abbreviations 


and 


(x-a)/y  E  ( (x1~a) /y, ...» (x^-a) /y) 

a  +  bx  E  (a+bx, , . . . , a+bx  )  . 

~  1  n 


In  the  Quantile  Problem 

W^(x;a,y)  =  (Xj-oO/y  , 


(3.2) 


while  in  the  Best  Choice  Problem 


W^n)(x;a,y)  =  1 
=  0 


if  x.  =  min-fx..  , . . .  ,x  } 
i  L  1  n 

otherwise. 


or,  equivalently 
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W^(x;a,Y)  =  1 

=  0 


if  (xi-a)/y  =  min1 <n { (x ^ -a) /y } 
otherwise. 


(3.3) 


Both  of  these  loss  functions  are  clearly  invariant.  (In  addition,  they  are 
bounded  by  one  and,  for  i  ^  3,  symmetric  in  x^  and  ,  properties  which 
will  be  needed  later.) 

When  the  loss  is  invariant,  the  risk  of  a  stopping  rule  T,  say  p^(a,Y) > 
becomes 


PT(a,Y)  "  Ea>yw^n)  ((X-oi)/y) 

a+Y  a+Y  n  ,  .  n 

=  /  •••  /  l  wi  ((x-a)/Y)I/  (  )=i}Y_n  Tf  dx 

a  a  i=l  i=l  1 


n 


E0,lwi  ^  (?):E{T(a+YX)=i} 


(3.4) 


The  last  equality  follows  from  a  change  of  variables  in  the  integral. 

(Note  that  although  we  write  X  =  T(x^,...,x  ),  the  fact  that  T  is  a 
stopping  rule  means  that  (x:x(x)  =  i}  depends  only  on  the  first  i  coordinate 
of  x. ) 

3.2.  Minimaxity  of  Best  Invariant  Rules 

Whenever  the  loss  function  is  invariant  under  some  group  of  transforma¬ 
tions,  it  is  natural  to  look  for  a  minimax  stopping  rule  within  the  class  of 
invar iant  rules.  In  this  case  an  invariant  stopping  rule  x  is  one  which 
satisfies 

x(a  +  bx)  =  x(x)  (3.5) 

for  all  real  x^,...,x  and  a  and  all  positive  b.  For  example,  any 
policy  of  the  following  form  is  invariant: 
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T (X.. , . . . ,X  )  =  min{j  5 i  :  (X. -L. ) / (U ,-L . )  £  c . } 
1  n  n  11  13  3 


(3.6) 


where  i  is  some  integer  with  2  £  i  £  n. 


L_^  =  min{X^, . . .  ,X  }  , 


(3.7) 


U .  =  max{X,  , . .  . ,  X . }  , 
l  1  i 


(3.8) 


and  the  c.'s  are  some  constants  with  0  £  c.  <1  for  i  £  i  <  n,  and 
l  in 


c  =  1. 
n 


Also  invariant  are  all  relative  rank  rules,  such  as 


x(X1,...,Xn)  min(i^in:  £  1{x.  £X.}^ki^ 

3=1  3  i 


(3.9) 


where  the  k/s  are  some  positive  integers  with  kR  =  n. 

By  (3.4)  and  (3.5),  the  risk  of  any  invariant  rule  is  constant,  and  can 
be  written  as 


1  n 


Px  f  I  I  wi  ^^1{x(z)  =  i}  "fT  dzi 

0  0  i=l  S  i=l 


n 


(3.10) 


The  following  Proposition  can  be  obtained  from  a  theorem  of  Kiefer  (1957) 


Proposition  3.1.  Let  X^,X2,...,Xn  be  IID,  each  uniformly  distributed 

on  the  interval  (a,a+y)  where  a  is  real  and  Y  is  positive.  For  each 

i  =  2,3,...,n,  and  0  ^  z.  £  1,  let  the  loss  function  wfn^ (z,  , . . . , z  )  be 

l  l  1  n 

bounded  and  non-negative.  Let  T  be  a  stopping  rule  based  on  the  X^'s 
with  T  >  2.  Finally,  choose  £  >  0.  Then  there  is  an  invariant  rule  x' 


with  risk 
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p  .  s?  sup  p  (a,y)  +  e- 

T  -oo<a<oo,y>0  x 

This  Proposition,  proved  in  Appendix  B,  immediately  yields 

Theorem  3.1.  Under  the  conditions  of  Proposition  3.1.,  if  there  is  a 

best  invariant  rule  in  the  class  of  all  stopping  rules  based  on  X,,X_,...,X 

J-  l  n 

which  always  take  at  least  two  observations,  then  it  is  minimax. 

Following  Kiefer  (1966)  we  should  re-phrase  Proposition  3.1  as:  "For 
stopping  rules  which  take  at  least  two  observations,  the  Hunt-Stein  property 
holds."  This  is  in  recognition  of  the  celebrated  unpublished  theorem  of  Hunt 
and  Stein  in  the  early  1940 's  which  gave  conditions  under  which  a  best  in¬ 
variant  procedure  in  a  hypothesis  testing  problem  must  necessarily  be  minimax. 

3.3.  Pseudo- Invariant  Rules  and  Their  Bayes  Risks 

Theorem  3.1  begs  the  question:  Is  there  a  best  invariant  rule  and  if 
so  how  do  we  find  it?  Within  the  restricted  class  of  stopping  rules  which 
always  take  at  least  three  observations  this  question  turns  out  to  have  an 
easy  answer. 

Let  Z  and  u  be  any  numbers  with  Z  <  u.  If  T  is  an  invariant  rule 
with  3  £  T  £  n,  then  each  of  the  following  stopping  rules, 

T1(x1,...,xn_2)  =  T(£,u,x1,...,xn_2)  -  2, 

and 

T2(Xl,...,xn_2)  =  T(u,£,x1,...,xn_2)  -  2, 

will  be  called  a  pseudo-invariant  rule.  The  name  is  inspired  by  the  fact 
that  both  and  differ  from  the  invariant  T  only  in  that  they  treat 

the  constants  Z  and  u  as  though  these  were  the  first  two  observations; 
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hence  x^  and  T  are  based  on  only  n-2  rather  than  n  actual  observa¬ 
tions.  (Later,  when  we  restrict  attention  to  t's  which  are  symmetric  in 
the  first  two  variables,  there  will  be  just  one  pseudo-invariant  x  corre¬ 
sponding  to  X.) 

We  now  consider  the  Bayesian  problem  with  n-2  available  observations 
in  which  a  and  8,  the  end-points  of  the  underlying  uniform  distribution, 
are  given  the  particular  a_  priori  density 

7T(a,B|£,u)  =  2(u-£)/(3-a)3,  -»<  a<£<u<3<co  (3.11) 


(this  is  (1.1)  with  k=  2)  and  the  loss  for  accepting  the  i-th  price  is  taken 

to  be  either  Wi+2^»u,Xi»  *  ‘  *  ,Xn-2’a’  °r  Wi+2  ^u’  ^»X1»  *  *  *  ,Xn-2;a’  * 

(The  ambiguity  here  is  only  temporary.)  We  use  the  notation  E  to  denote 

expectations  taken  with  respect  to  the  density  (3.11).  This  particular 
Bayesian  problem  provides  the  key  to  the  problem  of  finding  best  invariant 
rules  because  the  constant  risk  of  any  invariant  x  >  3  happens  to  be  just 
the  average  of  the  Bayes  risks  of  the  corresponding  pseudo-invariant  x^  and 
Using  the  abbreviation 


<  £,u,X) 


/  £-q  u-q  X1  a 

^  8-a  ’  8-a  »  8-a  ’ 


8-q 


) 


we  have 


Proposition  3.2.  Under  the  same  conditions  as  in  Proposition  3.1.,  with 
8  =  a  +  y,  if  x  is  invariant  and  x  ^  3,  then  for  every  £  and  u  with 
-°°<£<u<°°, 
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PT  2 


L%uw*r  (i,u,X1, . . .  ,Xn_2) (<  £,U’P) 


+  E  w<»>  Y  «u,£,X>)~] 

£,u  t(u,£,X1,  . . .  ,Xn_2)  ~  J 


(3.12) 


Proof :  Letting  z'  =  (z^,...yz^)  we  first  re-write  (3.10)  as 


n 


2  1  1 

p x-  x  /  ...  /  / 

j-1  0  00 


dz. 


n 

l 

i=3 


(n)  . 

w;  (z .  , 
i  J 


n 


Z3-j’~’)I{x(5 


3-j 


')  =  i> 


TT  dz . . (3 . 13) 


i=2 


Next  we  exploit  the  invariance  of  x  to  write 


£,u  WT(£,u,X. 


.(n) 


1* 


•’Xn-2) 


«  £,u,X» 


-I  I  i«^.x»i{  « 

u  -oo  a  a  1=3 


n+1 


n 

]  (  Tf  dx  )  dad 3  (3.14) 
i=3 


and  make  the  change  of  variables 


z1  =  (£-a)/(3-a);  z2  =  (u-a)/(3-a) 


z^  =  (x^-a)  /  (f3-a) ,  i  =  3 , . . .  ,n  . 

n_|_i 

The  Jacobian  of  this  transformation  is  (B-oO  /  (u-£)  so  the  right  side  of 
(3.14)  becomes  simply  twice  the  j  =1  term  of  the  right  side  of  (3.13).  The 
j  =  2  term  is  derived  similarly.  n 


Suppose  now  that  none  of  the  losses 


i  £  3,  depends  on  the  order 


of  the  first  two  observations.  This  eliminates  the  ambiguity  in  the  definition 
of  the  loss  function  for  the  Bayesian  problems  and  simplifies  (3.12)  for  in¬ 
variant  rules  which  do  not  depend  on  the  order  of  the  first  two  observations. 


17 


It  also  enables  us  to  ignore  all  other  invariant  rules  as  the  following 
proposition  shows.  (The  proposition  is  an  easy  consequence  of  (3.10)  so 
the  proof  is  omitted.) 

Proposition  3.3.  If  for  each  i  ^  3,  w^n^ . . .)  =  w^n^ . . .) , 
then,  for  any  invariant  x  ^  3,  PT  ^  min(p^_ , , p^M)  where  x*  and  x"  are 
the  invariant  stopping  rules  defined  by 

X?  (z1,z2> . . .)  =  x(max(zi,z2) ,min(z  ,z  ) , . . .) 

x"(z1,z2, . . .)  =  x(min(z1,z  ) ,max(z  ,z  )  , . .  .)  . 

Propositions  3.2  and  3.3  immediately  yield: 

Theorem  3.2.  Under  the  conditions  of  Propositions  3.1  and  3.3  a  suffi¬ 
cient  condition  for  an  invariant  stopping  rule  x*  £  3  to  be  best  among  all 
invariant  rules  based  on  X^,X2,...,Xn  which  always  take  at  least  three 
observations  is  that  x*  be  symmetric  in  the  first  two  variables  and,  for 

each  Z  and  u,  the  pseudo-invariant  rule  x  defined  by  x(x, ,...,x  „)  = 

1  n-z 

X*(&»u,x^, . . . ,x  2)  -  2  be  a  Bayes  rule  when  there  are  n-2  available  observa¬ 
tions  and  the  a  priori  density  is  given  by  (3.11). 

(The  phrase  "...for  each  Z  and  u  ..."  in  the  theorem  can  be  weakened 
to  "...for  some  Jl  and  u  ..."  but  the  stronger  version  suffices  for  the 
applications  in  the  next  section  and  is  more  relevant  to  the  comments  which 
follow. ) 

To  put  Theorem  3.2  in  proper  perspective,  we  need  to  introduce  the 
"density"  TT^(a, B)  =  (g-a)  \  <  a  <  3  <  °°.  This  is,  of  course,  the 

density  of  an  infinite  measure.  That  measure  is  the  so-called  right  in¬ 
variant  measure  on  the  parameter  space  which  is  introduced  in  Appendix  B 
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where  it  plays  a  key  role  in  the  proof  of  Prosition  3.1.  If  we  ignore 
the  fact  that  it  is  an  "improper  prior"  (because  it  is  infinite),  imagine 
that  are,  conditionally,  IID  uniform  on  (a,B)  with  (a, 3) 

having  prior  "density"  TT^,  and  compute  formally  the  posterior  density  of 
a  and  3  given  X^  and  X^»  we  get  precisely  (3.11)  with  l  and  u 
replaced  by  min(X^,X2)  and  max(X^,X2),  respectively. 

The  so-called  formal  Bayes  rule  with  respect  to  the  improper  prior 
density  it  —  within  the  class  of  stopping  rules  which  always  take  at 
least  three  observations  —  is  then  the  rule  which  is  conditionally  Bayes 
with  respect  to  the  proper  (i.e.,  probability)  densities  7T (a, 3 |min(X^,X2) , 
max(X^,X2) . ) .  Thus  what  Theorem  3.2  says  is  that  if  the  formal  Bayes  rule 
is  invariant  (and  symmetric  in  the  first  two  observations)  then  it  is  best 
invariant  among  rules  X  ^  3.  Similar  results  for  estimation  problems  are 
discussed  in  Kiefer  (1966) . 

3.4.  When  to  Take  Less  Than  Three  Observations 

We  need  to  know  when  a  stopping  rule  which  is  best  invariant  among 
rules  which  always  take  at  least  three  observations  can  be  beaten  (in  the 
minimax  sense)  by  a  rule  which  may  take  fewer  than  three  observations.  A 
satisfactory  result  is  the  following: 

Theorem  3.3.  Under  the  conditions  of  Propositions  3.1  and  3.3.,  if 
n  =  2  then  t =  1  or  x  =  2  is  minimax;  if  n  ^  3  and  there  exists  a  best 
invariant  stopping  rule  x^  among  rules  which  always  take  at  least  three 
observations,  then  one  of  the  following  five  invariant  rules  is  minimax 


among  all  stopping  rules: 
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(a)  t  = 

(b)  x  = 

(c)  t  E 

(d)  t  = 


n 


1 

2 


2  if  X2  <  X 


x  otherwise 
n 


(e)  t  = 


2  if  X1  <  X2 


=  x  otherwise, 
n 


Proof :  There  are  just  two  key  steps  to  the  proof.  The  first  is  to 

observe  that  any  t  which  for  some  ct  and  g  has  a  non-zero  probability 

of  taking  the  first  observation  —  i.e.,  for  which  the  Lebesgue  measure, 

y  ( • )  >  of  A  =  {x^:  x(x^,...)  =  1}  is  greater  than  zero  —  must  have  a 

maximum  risk  at  least  as  great  as  the  constant  risk  of  the  rule  =  1. 

This  is  obvious  if  contains  an  interval,  say  (c,d),  since  for  a  =  c, 

3  =  d,  the  rule  does  always  take  the  first  observation.  That  it  is  true 

in  general  follows  from  the  following  measure  theoretic  result:  If 

y(A)  >  0  and  if  e  <  1,  then  there  exists  a  finite  interval  I  such 

that  y(AnI)  >  ey(I)  (see  e.g.  Halamos  (1950),  pp.  68).  In  other  words, 

we  can  find  parameter  values  a  and  B  such  that  P  „(T  =  1)  is  as  close 

ot  9  P 

to  one  as  we  like.  Then,  from  the  boundedness  of  the  loss  functions  {W^n^} 
we  conclude  that  ( (X-a)  /  (g-a) )  will  be  as  close  as  we  like  to 

Ea,gW^)((X-a)/(B-a))  =  p^. 

What  we  have  shown  so  far  is  that  E  1  together  with  all  rules  which 
always  take  at  least  two  observations  forms  a  complete  class.  (This  completes 
the  proof  for  n=2.)  But  Proposition  3.1  implies  that  we  still  have  a  complete 
class  if  we  consider  only  invariant  rules  which  always  take  at  least  two  ob¬ 
servations.  And  it  is  easy  to  see  that  all  invariant  rules  T  ^  2  are  of  the 
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form  t  =  2;  or  T  =  2  if  X^  <  ^l’  T  =  t'  £  3  otherwise;  or  t  =  2 
if  >  X  ,  t  =  t'  ^-3  otherwise;  for  some  invariant  t '  >3. 

The  second  key  step  in  the  proof  is  to  invoke  the  symmetry  of  the  loss 
in  the  first  two  variables  in  dealing  with  the  latter  two  cases.  The  same 
simple  method  which  yields  Proposition  3.3  also  shows  that  we  can  restrict 
ourselves  to  those  t'  which  are  symmetric  in  the  first  two  variables. 

For  such  rules,  the  conditional  risk  obviously  does  not  depend  on  whether 
or  not  <  X^.  Hence  conditioning  on  {t=2},  an  event  of  probability 
1/2,  we  have 

Px  =  \  ^E0,ltW2n)(~}^=  21  +  Px')  (3'15) 

«jj» 

which  is  clearly  minimized  by  taking  T'  =  T  .  D 

3.5.  An  Algorithm  for  Risks  of  Some  Invariant  Rules 

In  Section  5  we  shall  want  to  evaluate  the  asymptotic  risk  —  in  the 
Quantile  Problem  —  of  some  non-minimax  policies  of  the  form  (3.6).  Here  we 
provide  the  tools  for  the  evaluation  by  deriving  an  algorithm  for  computing 
the  risk  of  any  policy  of  that  form. 

We  begin  with  the  following 

Lemma  3.1;  If  T  is  of  the  form  (3.6),  then,  for  each  i  £  i  , 

T  I  <  is  independent  of  the  pair  (L^,IL). 

Proof :  It  is  straightforward  to  check  that,  regardless  of  the  values 

of  a  and  Y,  the  conditional  distribution  of  the  sequence  of  (X^ -Lj  ) / (U_^-L^ )  '  s , 
j  =  2,3,...,i,  given  and  tk  ,  is  that  of  the  sequence  of  (Z^  )  /  (tk-L..  )  '  s 

where  Z^,Z^, . . . is  a  random  permutation  of  the  constants  0  and  1  to- 


21 


gether  with  i-2  IID  random  variables  each  uniform  on  (0,1),  and 

i  » 

L.  and  U.  are,  respectively,  the  smallest  and  largest  of  (Z  ,  .  ..,7, 

1  1  1  J 

Since  this  conditional  distribution  does  not  depend  on  and  U.,  the 

result  follows  immediately.  ° 


and 


From  the  Lemma  and  the  independence  of  the  two  vectors 

(L.,U.,Ir  we  have  the 

i  i  (x  £  i> 


(X1+1, 


Corollary:  For  each  i  ^  i^  the  conditional  distribution  of  the  pair 

^i’l*i^’  given  that  x  >  i,  is  the  same  as  its  unconditional  distribution. 

Hence  so  are  the  conditional  distributions  of  the  vector  (L. ,U. ,X. ,X  ) 

i’  1+1’  ’  n 

and  of 

x(l+1)  -min{j*i+l:  (X  -L..)/(U  -L..)  sj  c  }  . 

To  clarify  the  meaning  of  in  this  corollary  we  make  the  following 

definition: 

If  x  is  of  the  form  (3.6),  then  for  any  i  with 

i  ^  i  £  n,  we  denote  by  x  ^  the  policy  of  the 

same  form  as  x  except  that  i  is  replaced  by  i, 
r  •  \  n 

and  by  x^  ’  the  corresponding  pseudo-invariant  rule.  (3.16) 

In  other  words  x^^  is  x  delayed  until  i.  And  the  corollary  says  that 

/  j  \ 

the  conditional  distribution  of  x  given  that  x  £  i  is  simply  that  of  xv 
Suppose  now  that  the  losses  do  not  depend  on  the  past.  That  is,  for 
each  i. 


w^  ((x-cO/y) 


((xi~a)/y, (xi+1-a)/y, • • • , (xn-a)/y)  . 


Then,  by  the  corollary,  for  each  j  >  i  5  in 


(3.17) 
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E{w_j 


(n) 


((X-a)/y)l{  j|t 


>1}  = 


E  w^n)((X-a)/Y)I  ,.+1v 

2  ~  {T(l+1)=j} 


hence 


E{wT(n)  ( (X-a)  /Y)  |  t  >  i} 


E  w 


(n) 

T(i+1) 


( (X-a) /y) 


(i+1)  ' 


(3.18) 


(3.19) 


If  T  ^  3,  then  similar  results  hold  for  T,  the  pseudo- invariant  policy 
corresponding  to  T  in  the  Bayesian  problem.  Using  (1.2)  with  the  prior 
density  (3.11)  we  can  verify  immediately  that  the  conditional  distribution  of 
T  given  X^,...,X_^_2  is  identically  that  of  on  {f  >  i-2}  for  each 

i  ^  in>  and  that,  for  losses  satisfying  (3.17),  for  each  j  >  i  ^  i 


E{w; 
1 


(n) 


«  £,u,X»I{~=j_2} \XV  . . .  ,X._2>  =Ewjn)  «  ^,u,X>)I  ^(.+1) 


(t' 


’=3-2} 


on 


(t  >  i-2}  . 


So,  corresponding  to  (3.18)  and  (3.19)  we  have  for  each  j  >  i  ^  i 


n 


E{w^n)«£,u,X»If~  -,|T>i-2}=Ew<n)«£,u,X»I  ,  n 

3  ~  11-3-z/  3  ~  2} 


(3.20) 


and 


E(w^«  £,u,X  ))  |f  >  i-2}  =Ew["|+1)  «  &,u,X» 


(3.21) 


Now  Proposition  3.2  establishes  the  equality  of  the  right  sides  of  (3.19) 
and  (3.21);  and  the  proof  of  that  proposition  shows  clearly  that  it  holds 
"term-by-term"  (i.e.,  for  each  i  )  so  the  right  sides  of  (3.18)  and  (3.20) 
are  equal.  Thus,  given  T  ^  3  satisfying  (3.6)  and  losses  satisfying  (3.17), 
if  we  define 

-  E(w((X-a)/Y)|T>i),  i  =  i  -l,i . n-1  (3.22) 

1  1  ~  n  n 

and 

=  E(w~+2«  £,u,X  >)  |*t  >  i-2)  ,  i  =  in-l,in, . .  .  ,n-l 
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we  have  not  only  equality  of  and  for  each  i,  but  also  equality 

of  the  coefficients  used  to  compute  them  from  the  following  identities: 

<P±_- 1  =  E[wi((X-a)/y)I^T=i]  |t  >  i-1]  +  </\P(x  >  i|x  >  i-1) 

-  E[w.((X-ci)/Y)I  ]  +  ^.P(x(l)  >  i)  (3.23) 

1  ~  {T^^i} 

and 

E[w.(<  £,u,X>)I  ]  +  V  .P(T(l)  >  i-2)  . 

1-1  1  ~  {?(l)=i-2}  1 


So  there  is  a  choice  of  methods  of  evaluating  (3.23).  Either  is  straight¬ 
forward  as  soon  as  we  note  that  because  0  ^  c,  <  1  for  each  i  <  n,  we  can 
replace  (3.6)  by 


t(X)  =  min{j  £  i  :  (X.-L.  ^  )  /  (U .  -L .  J^c.} 
n  3  J-l  J-l  3-1  3 

=  n  if  no  such  3  <  n, 

the  advantage  being  that  X^  is  independent  of  and  ^j-1' 

that 

t(X)  =  min{j  >  i  -2:  (X.-L  )/(U.  .-L  J  ^  c,  } 

n  3  3+I  3+I  3+I  3+2 

=  n-2  if  no  such  3  <  n-2 , 


This  implies 


where,  for  i  ^  3, 


and 


mm{  H , u ,  X^ , ...  , X^^^} 


=  max{£,u,X^, . . . ,X^_2}  . 


(3.24) 


(3.25) 


We  have  then,  by  direct  calculation 
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P(T(l)>i)  =  P(Xi>L1_l  +  ci(Ui_l-Li_1)) 


P(?<i)>i-2)  .  P(X„2>  Li.1*c1(U1_1-Li_1)) 


=  [  (i-1)  -  (i-2)^]  /i,  i  =  in,in+l,  .  . .  ,n-l 


and  in  the  special  case  of  the  Quantile  Problem, 


r-X.-a 
1 


Ejw.((X-a)/Y)I^(1)=_}_=  E|_6-a  ^X1<L1_1  +  ci(U1_1-L±fcl)i 


=  E 


w.  (<  £,u,X>)I 

~  i^=i-2p 


=  E 


rXi_2~a 


{X_2«  Li.1  +  Ci(ui_1-L1_1)L 


t  i  +  (i-2)c±+-|  (i-1)  (i-2)c^| /i(i+l)  ,  i  =  in> in+1 , .  .  .  ,n-l  , 


and 


Vi '  E(Va)/Y  - 1  • 


Substituting  back  into  the  right  side  of  (3.23)  yields  the  following 
algorithm  for  the  risk: 

Proposition  3.4:  In  the  Quantile  Problem,  if  T  is  of  the  form  (3.6), 
then  its  risk,  p  ,  equals  where 


V±_1  =  [  1  +  (i-2)ci  +i  (i-1)  (i-2)c?]  /i(i+l)  +<p^.  (i-l)-(i-2)ci] /i. 


i  . 
n 


i  ri-1  j  ti  2  j  » j 


(3.26) 
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4.  MINIMAX  RULES 

Now  we  specialize  to  the  Best  Choice  and  Quantile  Problems  for  which, 
as  previously  noted,  the  conditions  of  Propositions  3.1  and  3.3  are  satisfied; 
so  Theorems  3.1,  3.2,  and  3.3  are  at  our  disposal. 

When  k=2  in  (1.1)  —  so  the  a  priori  density  is  given  by  (3.11)  — 
Stewart's  (1978)  results  are  as  follows: 

In  the  Best  Choice  Problem  the  Bayes  rule  with  n  -  2  available  observa¬ 
tions  is 

=  min{i £ max{i(n)  -  2,1):  =  min{£,X^ , . . . jX^}} 

=  n-2  if  no  such  i  <  n-2  (4.1) 

where  i(n)  satisfies  (2.12). 

In  the  Quantile  Problem  the  Bayes  rule  with  n-2  available  observations 
is 

Tn  =  min{i*i(n)  -2:  (xi"£i+2) /(^i+2^i+2}  ^ci+2(n>}  (4.2) 

where  and  XL  are  given  by  (3.24)  and  (3.25),  respectively,  and  the 

c^(n)'s  and  i(n)  satisfy  (writing  c^  for  c^(n)): 

c  =  1 
n 

Ci-1  =  Cid-V2)  -  (i+l)~1ci(l-ci)  -  (i-2)'1(i+l)~1(l-ci)  (4.3) 

and 

n  -  i(n)  =  min{j  $  n  -  3:  c^  ^  ^(n)  <  0} 


=  n  -  3  if  no  such  j  . 


(4.4) 
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The  invariant  rules  corresponding  tp  these  Bayes  rules  are  of  the  form 

(3.6),  and  those  for  the  Best  Choice  Problem  are  also  of  the  form  (3.9)  with 

* 

k.  =1  for  i  <  n.  Thus  in  the  Best  Choice  Problem  the  t  in  Theorem 
l  n 

3.3  is  a  rule  based  only  on  relative  ranks.  Hence  so  are  all  five  of  the 
rules  there.  We  therefore  have 


Theorem  4.1:  In  the  Best  Choice  Problem,  the  best  stopping  rule  based 
only  on  relative  ranks  is  minimax. 

From  (2.12)  it  is  easy  to  check  that  in  the  Best  Choice  Problem 
i(2)  =  1,  i ( 3)  =  i(4)  =  2  and  i(n)  >  3  for  n  ^  5. 

In  the  Quantile  Problem  the  risks  of  the  five  stopping  rules  in  Theorem 
3.3  are  p  A,  1/2,  1/2,  j  (1/3  +  p  *) ,  and  j(2/3  +  p  A) ,  respectively. 


n 


n 


n 


(This  follows  from  invariance  plus  the  fact  that  if  and  are  IID 

uniform  on  (0,1),  then  EZ^  =  EZ^  =  1/2,  E(Z2|Z2<Z^)  =  1/3,  and 

E(Z„ I Z„ >  Z_ )  =  2/3.)  Since  p  .  must  be  less  than  1/2  (the  risk  of  the 

rule  t  =  3)  only  the  first  and  fourth  of  these  five  stopping  rules  are 

admissible.  Now  p4  is  decreasing  in  n  simply  because  the  class 

Tn 

{t:  3  £  t  £  n}  is  increasing  in  n.  Thus  the  minimax  stopping  rule  will 

be  completely  determined  as  soon  as  we  find  the  smallest  n  for  which 

p  *  $  1/3. 

Tn 

This  is  easily  obtained  using  the  fact  that,  by  Theorem  3.2,  p  .  is 

Tn 

also  the  Bayes  risk  of  its  pseudo-invariant  counterpart  Tn  given  by  (4.2). 


That  is,  for  each  n  £  3, 


PT* 

n 


=  b 


n-2 


(4.5) 


and 


mn  =  min{bn_2,  |(|+  bn_2)} 


(4.6) 
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where 

=  minimum  Bayes  risk  in  Quantile  Problem 
with  n  available  observations  and 

a  priori  density  given  by  (3.11)  (4.7) 

and 

m  =  minimax  risk  in  Quantile  Problem  with 
n 

n  available  observations.  (4.8) 

Stewart's  proof,  based  on  backward  induction,  provides  what  we  need.  Let 


^n)  =  inf{T:1_2<^n_2}E[(XT-a)/(6-a)!xi . X._2] ,  i  =  2,3,...,n-l  (4.9) 


so 


^(n)  =  b 
z  n-2 


(4.10) 


As  Stewart  showed,  the  and  the  c^(n)'s  are  related  by 


=  (i+1)  1  { (i-l)c^(n) + 1}  i(n)  -  1  £  i  £  n  -  1 


(4.11) 


and 


*<">  .  #<”>  .....  *<") 


(4.12) 


i(n)-l 

[PUT  TABLE  1  HERE] 

Table  1  uses  (4.3),  (4.6),  (4.10),  and  (4.11)  to  get  the  explicit  solution 

to  the  Quantile  Problem  for  n  £  6.  We  can  see  from  the  table  that  the  smallest 

n  for  which  p  .  =  b  _  $  1/3  is  n=6.  With  these  numerical  results  we  can 
x*  n-2 


n 


now  state 


Theorem  4.2:  In  the  Quantile  Problem,  for  all  n  ^  6,  m  =  b 

n  n— 

the  minimax  stopping  rule  based  on  n  available  observations  is 


x 

n 


min{i  ^  i(n) : 


(X.-L^/GJ.-L.) 


£  c , (n) } 

l 


and 
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where  L.  =  min{X, , . . . ,X . } ,  U.  =  max{X.,  ,  . . .  ,X . }  and  the  c.(n)'s  and 

1  lii  l  i  l 

i(n)  are  given  by  (4.3)  and  (4.4),  respectively.  For  n  =  3,  4,  or  5, 
the  minimax  rule  is  T  =  2  or  according  as  X^  <  X^  or  X^  >  X^ . 

And  for  n  =  2 ,  T  =  1  is  minimax. 
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5.  ASYMPTOTIC  MINIMAX  RISKS 

Since  the  minimax  rule  in  the  Best  Choice  Problem  is  just  the  best 
relative  rank  rule,  its  asymptotic  risk  is  1  -  e  3  ~  .6321,  from  (2.13). 
(This  limit  should  be  compared  with  the  value  =;  1  -  .5802  =  .4198  from  (2.9) 
which  is  the  asymptotic  risk  when  the  distribution  is  known.) 

For  the  Quantile  Problem,  we  shall  prove  the  following  theorem: 

i/yr 

Theorem  5.1:  lim  nm  =  (3  +  2/Y)  ,v  -  3.4780  . 

-  n  ->  oo  n 


(This  limit  should  be  compared  with  lim  nv  =  2  and  lim  nr  ~  3.8695, 

n->oon  n->oon 

from  (2.3)  and  (2.11),  which  are  the  asymptotic  risks  when  the  distri¬ 
bution  is  known,  and  when  the  best  relative  rank  rule  is  used,  respectively.) 


5.1.  Preliminary  Results 

We  collect  here  results  which  will  be  needed  for  the  proofs  of  Theorems 
5 . 1  and  6.1. 

Taking  =  n/(n+l)  for  convenience,  and  re-writing  (4.11)  as 


ci(n)  =  (i-1)  1  {(i+l)4ijn^  -  1),  i(n)  -  1  £  i  g:  n 


(5.1) 


we  can  then  re-write  (4.3)  in  either  of  the  following  forms: 

^i-1  =  2 (i-1)  (i+1)  +  (1-  i(i-l)  ^  ~2  ^  "  i(i-l)  )(^i  ^  ’  l(n)  ^  1  ^  n  (5*2) 


or 


,(n>  _ ,,,  (n)  g-  (i~2)  (i+1)  j  / .  (n)  1  x  , .  (n)  i  u  .  /n  .  ,,-ox 

;i  ^i-1  2i(i-l)  i^i  i+l)(^i  +  (i-2)  (i+1)  H  ’  (  )  $  *  (5*3) 


Also  (4.4)  becomes 
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n-i(n)  =  min{ j  g:  n-3:  <  (n-j)  1} 


=  n  -  3  if  no  such  j  . 


(5.  A) 


From  (5. 2) -(5. 4)  we  can  show  that 


i(n>  -  f<C)-irl] 


=  [m  'H  for  n  5  6 
n 


(5.5) 


(Here  [x]  denotes  the  integer  part  of  x.)  Hence  i(n)  is  non-decreasing 
in  n,  a  fact  which  can  he  strengthened  as  follows:  Since  the  right  side  of 
(4.3)  is  increasing  in  both  i  and  c^  it  follows  that 


Vjw  f 


in  n 


for  each  j  =  1 , 2 , . .  .  and  n  ^  i (n) +  j 


(5.6) 


Combining  this  with  (4.4)  we  conclude  that  i(n+l)  £  i(n)+l.  Thus,  for  each 
n, 

i(n+l)  =  i(n)  or  i(n)  +  l  .  (5.7) 


We  also  need  to  know  that 


t  <  1  =*  sup  sup.  nil^n)  < 

Fn  *i  g:  nt  ri 


(5.8) 


Here  a  probabilistic  argument  is  simpler  than  an  analytic  one.  Using  (2.11) 
and  the  fact  that  is  no  greater  than  the  risk  of  any  rule  which  totally 

ignores  the  first  n-i  observations,  we  have 


(n-i)4jfn)  ^  (n-i)m  .  <  (n-i)r  .  <  4 
l  n-i  n-i 


for  which  (5.8)  follows  immediately. 
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5.2.  Proof  of  Theorem  5.1 

o 

By  (5.8)  we  can  multiply  both  sides  of  (5.3)  by  n  ,  let  n  <»,  and 
conclude  that  for  i  >  i(n) 

i/n  -y  t  =*  ->  f(t)  (5.9) 

where  f  satisfies  the  Riccati  equation 

f’(t)  =  (1/2)  (f2  (t)  -t2)  (5.10) 

on  (0,1)  with  the  boundary  condition  f(l  )  =  oo.  And  from  (5.5)  we  can  also 
conclude  that 

i(n)/n  -y  t*  (5.11) 

and 

nm  -y  f(t*)  ,  (5.12) 

n 

t*  being  the  t  for  which  f(t)  is  minimized  and  where  f(t)  =  t 
The  change  of  variables 

f(t)  =  -2g’(t)/g(t)  (5.13) 

in  (5.10)  yields  the  linear  equation 

4t2g"(t)  =  g(t) 

which  has  the  general  solution 

gCt)  - 

The  boundary  condition  f (1)  =  °o  implies  a  =  b  which  uniquely  determines  f. 
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The  result,  from  (5.13),  is 


f(t) 


(l  +  /2)(3-  2/2  +  t/2) 
t(l-  t1^) 


which  attains  its  minimum  at 

t*  =  =  1/(3  +  2/2)3V^  =  Vf(t*)  ~  .2875. 

Applying  (5.12)  completes  the  proof.  D 

From  (5.1),  (5.9),  and  (5.14)  we  obtain  the 
Corollary:  If  i/n  t  >  t*,  then 


(5.14) 


✓2 


(n-i)c.(n)  -  (l-tHfOO-t*1)  ,^H)V --(3.12/2))^ 


td-t75') 


(5.15) 


The  right  side  of  (5.15)  increases  to  2  as  t  increases  to  1.  Table  2 
gives  some  values. 


[PUT  TABLE  2  HERE] 


5.3.  A  Simplified  Sequence  of  Policies 

The  corollary  to  Theorem  5.1  shows  that  when  n  is  large  the  minimax 
policy  is  to  let  approximately  t*n  (~  .2875n)  observations  go  by,  then  stop 


as  soon  as 
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(Xi-Li)/(U±-Li)  £  an((n-i)/n)/(n-i) 


(5.16) 


where  a^(t)  is  approximately  the  right  side  of  (5.15),  so  a^(t)  t  2  as 
t  increases  from  t*  to  1.  Not  surprisingly,  this  is  similar  to  what 
happens  when  the  distribution  is  known.  By  (2.3)  vn_^  is  approximately 
2/(n-i)  for  large  n.  Indeed  it  can  easily  be  checked  that  replacing  the 
right  side  of  (2.1)  by  2/ (n-i)  does  not  increase  the  asymptotic  risk.  It 
is  therefore  of  interest  to  see  what  happens  if  we  correspondingly  alter  the 
minimax  policies  by  replacing  c^(n)  by  2/ (n-i)  and  i(n)  by  a  free 
variable  chosen  so  as  to  minimize  the  risk. 

Substituting  c^  =  2 (n-i)  ^  into  (3.24),  writing  for  and 

re-arranging  terms,  we  can  show  that  if  i/n  -*■  t  as  n  ->  <»,  then 
n</^n^  -*■  f(t)  when 


f*(t) 


1+t 

t(l-t) 


(f(t)  - 


1+t 


(l-t2) 


). 


2 

and  f  satisfies  the  boundary  condition  (l-t)  f(t)  -*■  0  as  t  -*■  1.  The 
unique  solution  is 


f(t) 


(l-t2)  +2t2|£nt| 
2t (l-t)2 


(5.17) 


Thus  if  we  take  i  /n  -»■  t,  the  risk  is  asymptotically  n  ^f(t).  The  best 
n 

choice  of  t  is  the  one  for  which  f(t)  is  minimized.  By  numerically 
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evaluating  f  we  find  that  its  minimum  value  is  about  3.8215,  attained  at 
t  x  .3225. 

Comparing  this  value  of  3.8215  with  the  corresponding  values  of  3.4780 
for  the  minimax  policies  and  3.8695  for  the  best  relative  rank  rules,  we  see 
that  replacing  c^(n)  by  2 (n-i)  ^  yields  policies  which  are  sub-optimal 
but  still  manage  to  outperform  the  best  relative  rank  rules  by  a  whisker. 
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6.  MONOTONICITY  PROPERTIES  OF  THE  MINIMAL  RISK 

(This  section  is  devoted  entirely  to  the  Quantile  Problem.) 

All  of  the  minimal  risk  sequences  which  have  been  introduced  —  v 

n 

and  r  in  Section  2  and  m  and  b  in  Section  4  —  are  decreasing 
n  n  n  e 

in  n  simply  because  the  bigger  n  is  the  larger  is  the  class  of 

available  stopping  rules.  Multiplying  the  minimal  risk  by  n,  however, 

produces  an  increasing  sequence.  This  was  in  effect  shown  for  nr^  by 

Chow  et  al.  (1964),  as  noted  in  (2.11).  The  method  used  there  was  to 

condition  on  the  arrival  time  of  the  worst  of  n+  1  arrivals,  which  leads 

to  a  randomized  relative-rank-based  rule  for  n  arrivals  such  that  —  in 

the  notation  of  this  paper  —  (n+1)  times  its  risk  is  less  than  (n+ 2)r  ... 

n+1 

It  is  quite  easy  to  show  that 

nvn  +  in  n,  (6.1) 


and  two  proofs  will  be  given  in  Section  6.1:  a  purely  analytic  one  and  one 
using  the  method  of  Chow  et  al.  The  latter  proof  will  then  be  generalized 
in  Section  6.2  to  yield  the  main  result  of  this  section: 

Theorem  6.1:  nm  /in  n. 

-  n 

(We  should  remark  that  nb  is  also  increasing  in  n  since  b  =  m 

n  n  n+2 

for  n  4  and  monotonicity  for  smaller  n  can  be  checked  using  Table  1.) 


6.1.  Monotonicity  of  nv 


n 


The  analytic  proof  starts  with  the  recursion  (2.2).  (Note  that  if  we 


let  R  =  1 
n 


v  in  (2.2),  then 
n 


R 


n+1 


I  <1+Rn)! 


E1  "I 


36 


which  is  Gilbert  and  Mosteller's  formula  (5a-l).)  Now  x(l-x/2)  is 
increasing  on  (0,1)  so 


2  2 
n  n+1  n+1  n+1 


(1-^-)  =  -^^  2 


n+1  '  ,  ,  .  .  2  n+2 

(n+1) 


Since  $  2/ (1+1)  we  have  by  induction 


v  $  — —  for  all  n: 
n  n+1 


hence,  from  (2.2)  again, 


v 


n+1 


*  Vn^  -  ^+1 


(6.2) 


which  is  (6.1).  n 

For  the  second  proof  we  first  take  a=0,  B=l>  purely  for  notational 

convenience.  Now  suppose  we  are  told  both  the  value,  Un+^,  and  the  arrival 

time  of  the  maximum  (i.e. ,  worst)  of  X^,...,Xn+^.  Then  clearly  we  can 

improve  on  vn+^  by  using  an  optimal  policy  on  the  other  n  X_/s,  which 

are  conditionally  IID,  uniform  on  (0,U  +^) .  The  optimal  risk  is  then 

conditionally  v  U  hence  unconditionally  v  E  U  =  v  (n+1)/ (n+2). 

n  n+1  n  n+l  n 

Thus , 

n+1 

n+2  Vn  v  Vn+1 

which  is  slightly  stronger  than  (6.2).  D 
6 . 2  Monotonicity  of  nm^ 

Now  let  us  "tie  one  hand  behind  our  back"  by  insisting  that  only  in¬ 
variant  rules  be  allowed  on  the  other  n  X’s.  It  is  not  hard  to  see  that 

the  best  such  rule  does  not  depend  on  U  and  its  conditional  risk  is 

n+1 
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just  m  U  But  the  very  fact  that  the  rule  does  not  depend  on  U 

J  n  n+1  -  F  n+1 

prevents  us  from  asserting  at  once  that  it  is  an  improvement  on  m 

n+1 

(If  so,  then  we  are  done,  as  before.)  Here  is  where  the  work  begins. 

First  we  check  directly  from  Table  1  that  nmn  is  increasing  for 
n  £  6.  So  henceforth  we  will  take  n  ^  6.  Since  the  minimax  rules  have 
constant  risk  we  can  simplify  the  notation  by  taking  a=  0,  3=1.  Let  us 
also  abbreviate  T  the  minimax  rule  with  n+1  observations,  by 

simply  T. 

Now  we  define  the  random  variable  Z  by  X  =  U  ,  and  let 

Lt  ill"  _L 

I 

Ui  =  U.  if  i  <  Z 

=  max{Xj :  j  £  i,  j  ^  Z}  if  i^Z. 


Then,  noting  that  T  =  Z  only  if  Z  =  n+1,  we  let 


°  Tl{x<Z}  +nI{T=Z=n+l}  +  aZI{T>  Z;  Z  <  n} 


where  for  1  $  z  £  n. 


az  =  min{i  >  max{z ,  i(n)  }  :  (X^-L^) /  (U^-L^)  <  ci—1  Cn) } 


Thus,  on  {x  >  Z;  Z  £  n},  a  is  essentially  the  minimax  rule  with  n 

observations  except  that  it  ignores  X^  entirely  and  treats  ^+1’ " * • ’^n+i 

as  though  they  were  X^.,  •  •  •  >^n*  Conditional  on  a  is  in  effect  a 

randomized  invariant  rule  applied  to  n  IID  random  variables,  uniform 

on  (0,U  . , ) .  Hence 
n+1 


E(X|U  ..)  =  mU 
a 1  n+1  n+1 
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where  the  constant 


Since  EX^ 


m 


h+1 


that 


m  is  at  least  as  large  as  Thus 

EX  S;  m  EU  =  m  (n+l)/(n+2)  . 
o  n  n+l  n 

we  see  that  to  prove  Theorem  6.1  it  suffices  to  show 


EX  £  EX  . 
a  T 


Clearly 


V{t<Z}  XtI{t<Z} 


and,  letting  A  =  {t  =  Z  =  n+l}. 


X  I.  = 
a  A 


XI,  < 
n  A 


X  - 1 .  = 
n+l  A 


X  IA 
T  A 


So,  what  remains  to  be  shown  is 


EV{x>z, 


Z  $  n} 


EX  I, 

T  {t  >  Z, 


Z  $  n} 


(6.3) 


We  do  so  by  showing  that  the  corresponding  inequality  holds  identically  on 
{Z  £  n}  for  the  conditional  expectations  given  Z  and  And,  to 

simplify  matters,  we  divide  both  conditional  expectations  by  which 

eliminates  the  dependence  on  Un+^.  Specifically  (6.3)  is  implied  by 


E<XaU«lI{T>z}lZ-U„+l)  *  E<XtD 


!  ■*■  I 

n+l  (t  >  Z> 


identically  on  {Z  £  n} 


(6.4) 


where  both  sides  of  (6.4)  are  functions  of  Z  only. 
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We  can  describe  these  two  functions  of  Z  as  follows:  Let 
Y1,Y2’ *  *  * »^n  random  variables,  each  uniform  on  (0,1);  let 

f 

and  LL  be  the  smallest  and  largest,  respectively  of  Y^,...,Y^;  also 
define 


V. 

l 


=  L^_1  +  ci(n)(U^_1-L^_1) 


and 


f 

V. 

1 


Li-1 +  ■=1+1<n+1>(1-I',i-1) 


For  each  z  £  n  let 


and 


o(z)  =  min{i  ^  max{z,i(n)}:  Y^  g:  V^} 
t(z)  =  min{i  >  max{z, i(n+l)-l} :  Y^  g:  V^}  . 


Then,  on  {Z=z},  the  left  and  right  sides  of  (6.4)  are  EY  .  .  and  EY  ,  . 

o( z)  t(z) 

respectively. 

Now  we  are  in  a  position  to  establish  (6.4)  by  backward  induction.  Let 


Then 


and 


=  $±(z)  =  E(Ya(z) la<z)  >  *■> 

6i  =  Si(z)  =  E(Yt(z)It(z)  >  i)  • 


A 


n-1 


n-1 


1 

2 


*z-l(z)  ■  EYa(z)’ 


6  1(z)  =  EY  .  . 

z— 1 v  T (z) 


(6.5) 


(6.6) 


Moreover 
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,  (n) 

■  h-1 

if  max{z, i(n) }  ■<  i  £  n-1 

A 

=  K-i 

if  z  $  i  <  i(n)  , 

(6.7) 

where  is  defined  by  (4.9);  also 


=  EY^I|y  _  <  y_ }  +  <5,.P(Y_.  >  V^)  if  max{z,  i(n+l)-l}  £  i  £  n-1 


=  6 


z-1 


if  z  $  i  <  i(n+l)  -  1 


(6.8) 


By  (6.6)  the  proof  will  be  complete  when  we  show  that 


6  .  2  ib .  for  i  =  n-1  ,n-2 ,  .  . .  ,  z-1 
l  i 


(6.9) 


Equality  holds  for  i  =  n-1  by  (6.5)  so  we  can  proceed  by  induction.  First 
we  need  to  evaluate  (6.8).  Independence  of  Y^  and  V^,  together  with  the 
moments  El/_^  =  i  ,  =2i  (i+1)  ,  gives 


Ji-1 


1  »2  i 

~  EV.  +  6 . E(l-V . ) 

2  l  l  l 


1+  (i-l)ci+1(n+l) +y  i(i-l)  (ci+1(n+l))2 
_____ 


+(1-ci+l(n+1))(i=i)  6. 


=  6 


z-1 


if  max{z,  i(n+l)-l}  ^  i  g  n-1 
if  z  £  i  £  i  £  i(n+l)  -  1. 


For  0  c  c,x  £  1, 


let 
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1+  (i-l)c+-j  i(i-l)c2 

di-1(C’x)  =  iTi+T) - +  (1"c)  V  x 


and 


l+(l-2)c+l  (i-l)(l-2)c2  pi-1)  _  (1-2)c 

8i-r  ’  J  i(i+l)  L  i 


X 


Then 


6i_1  =  d±_x (c£+1  (n+1)  > 6^)  if  max{z , i(n+l)  -  1}  £  i  £  n-1 


and,  by  (3.26)  and  (6.7), 


$i_1  =  g^_^ (c±(n)  ,$_. )  if  max{z,  i(n)  }  ^  i  ^  n-1 


Also,  by  direct  calculation, 

di-l(c’x)  '  -j—y  c(c  -  <1+11>.X1  '  1  ),  (6.10) 

which  is  clearly  non-negative  if  and  only  if  c  ^  [ (i+l)x  -  1] / (i-1) . 

We  now  assert  that,  for  i  >  max{z,i(n+l)  -1},  6^  £  implies 

6i-l  *  di-l(ci+l(n+1)’*i)  *  8i-l(cl+l(n+1)^i)  *  k-1  •  (6*n) 

The  first  inequality  holds  by  the  induction  hypothesis  because  d^  is  in- 
creasing  in  x.  The  second  inequality  holds  by  (6.10),  (5.1),  and  (5.6)  if 
i  ^  i(n) ;  but  by  (5.7)  the  only  remaining  case  is  z  $  i(n)  =  i(n+l) ; 
i  =  i(n+l)  -  1,  in  which  case  the  inequality  holds  by  just  (6.10)  because 
then  [(i+l)i(>  -  l]/(i-l)  <  0.  Finally  the  last  inequality  of  (6.11)  holds 


because 
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mlnos«i  g1-i<c-'(,i)  ■  h-1 

if 

i  5  i(n) 

A  A 

>  lp.  =  lp.  - 

1  rl-l 

if 

i  =  i(n)  - 

Thus  (6.11)  shows  that  <5.  5  $.  where 

3  3 

2  =  max{z,i(n+l)  -  1}  $  max{z;i(n)}  . 


Referring  to  (6.7)  and  (6.8)  we  see  that  6  .  =  6.  and 

z-1  2 

(6.9)  holds  and  the  proof  of  Theorem  6.1  is  complete.  D 


ip .  .  Thus 
y3 
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APPENDIX  A 


Derivation  of  (2.10):  Without  loss  of  generality  we  may  assume  that  the 
underlying  distribution  is  uniform  on  (0,1);  and  for  compatibility  with  the 
work  of  others  we  shall  pose  the  equivalent  problem  of  maximizing  the  probabi¬ 
lity  of  selecting  the  largest  of  X^,X2» . . . jX^.  So  the  optimal  policy  is  to 

stop  as  soon  as  X.  =  max(X.  ,  X0, . . . ,X.)  and  X.  >  1  -  d  .  where  d~  =  1 
r  i  1  2*  l  l  n-i  0 

and,  for  i  <  n,  d  .  is  defined  by  (2.5).  For  convenience  we  let 
>  5  n-i  J 


and 

=  max(X^, . . . ,X^)  . 


It  should  be  noted  that  Gilbert  and  Mosteller's  argument  for  the 
optimality  of  this  policy  was  heuristic.  A  rigorous  argument  can  be  con¬ 
structed  either  by  dynamic  programming  or  by  a  Markov  chain  agrument.  The 
latter  is  provided  by  Bojdecki  (1978)  who,  in  effect,  uses  the  easily-verif iable 
identity 


i  Q)(l-x)J 

l  ^ - T - 

3=1  3* 


(x  j  -1) 

3 


(A.  1) 


to  re-write  (2.5). 

We  now  let  X-^^^,...  be  an  infinite  sequence  of  IID  random  variables 
each  uniform  on  (0,1),  let 


T  =  min{i:  X.  =  M  ^  b  .}  , 
n  ii  n-i 

so  is  the  optimal  policy  based  on  n  prices,  and  let  on  and  on  l>e 


defined  by 
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X  =  M  and  X  .  =  M 

an  a  a  -1 

n  n  n 


so  a  is  the  "arrival  time"  of  the  largest  of  the  first  n  prices  and 
n 


an  is  the  arrival  time  of  the  largest  price  prior  to  the  an~th.  Then, 


because  b 


n-i  \  1 


in  i  while  M. 


t  in  i, 


we  can  write 


P(X  =  M  )  =  P(M  >  b  and  M  .  <  b  ») 

x  n  n  n-a  a  -1  n-a 

n  n  n  n 


(A. 2) 


Now  we  make  the  change  of  variables: 


Z  =  n(l-M  );  T  =  a  /n 
n  v  n  ’  n  n 


z'  =  (a  -  1) (1  -  M  _/M  );  t'  =  a'/(a  -1) 
n  n  a  -1  n'  n  n  n 

n 


and  use  (2.6)  so  that  (A. 2)  becomes 


P(X  =  M  )  =  P (A  n  B  ) 
T  n  n  n 

n 


(A. 3) 


where 


A  ={z(l-T+n 
n  1  nv  n 


-1 


Cn(l-T  <  Cn(l-T  ) ^ 
n  n 


and,  letting 


K  =  n(l  -  T  t')  +  t'  , 
n  n  n  n 


-1 


B  =  {cv  <  [Z  +  (T  --)  "(1-n  1Z  )  Z '  ]  [1  -  T  T'+n_1(c^  +t')]} 
n  1  K  n  n  n  n  nJ  L  n  n  K  n  JJ 

n  n 


'  -1. 


Using  familiar  properties  of  the  uniform  distribution,  one  can  verify 


the  weak  convergence  result 
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.  .  ® 

(Z  ,Z  ,T  , T  )-*•  (Z.Z'.T.T') 
n’  n*  n  n 

where  Z,  Z',  T,  and  T'  are  as  described  following  (2.10).  This  in  turn 
implies  that  (A.  3)  converges  to  (2.10)  as  n  ->  «>. 

To  get  from  (2.10)  to  (2.9)  we  first  condition  on  Z  =  z,T  =  t,  and 
T  =  t f ;  the  conditional  probability  is 

-t(c/(l-tt ')  -  z)+ 

(z  <  c/ (1-t)}  ' 

Integrating  this  multiplied  by  the  exponential  density  of  Z  yields  the 

$ 

conditional  probability  given  T  =  t  and  T  =  t ’  which  is 

(l-t)_1  e-ct/(1-tt,)(1_e-c(1-t:)/(1-tt,))  +  e-c/(1-tt,)  _e-c/U-t)  _ 

The  final  step  of  integrating  this  expression  over  the  unit  square  requires 
the  change  of  variables 

u  =  (l-t)/(l-tt’) ,  v  =  l/(l-tt') 

on  all  but  the  last  term.  This,  with  the  help  of  (2.8)  and  the  identity 

f1  u_1(eCU-  1)  =  l  ck/klk, 

0  k=l 


yields  the  expression 

f°°  r1  -2.  .-1  -c(v-u) 

J  J  v  (v-u)  e  dudv  . 

0  0 

Letting  w  =  v-u  and  interchanging  the  order  of  integration  then  leads 
directly  to  (2.9)  which  can  easily  be  numerically  evaluated  from  the  identity 
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0° 

r  -1  -cx, 

J  X  e  dx  = 

1 


where  y  is  Euler's  constant 


I  log  c |  -  y  -  l  (-c)j/j!j 
j=l 


~  .577216. 
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APPENDIX  B 


Proof  of  Proposition  3.1:  (This  proof  is  nothing  but  Kiefer's  proof 
(1957;  p.  588-589)  of  a  much  more  general  theorem,  written  out  in  longhand 
for  this  special  case,  and  deleting  those  aspects  of  the  general  proof  which 
become  extraneous  in  this  case.  The  proof  uses  the  notation  introduced  in 
Section  3.) 

The  key  step  in  the  proof  is  this:  Given  any  stopping  rule  t  ^  2, 
we  use  it  to  define  a  large  collection  of  corresponding  invariant  rules  as 
follows:  For  each  real  a  and  positive  y  let  be  the  stopping  rule: 


T  (x)  =  x(a+y(x-x1)/ ^-xj) 


(Note  that  if  {x:  x(x)  =  1}  were  non-empty,  then  x  (x)  would  fail  to  be 

Oi  y  ~ 

a  stopping  rule.) 

+ 

If  we  now  let  Q  =  R  x  R  denote  the  parameter  space  and  y  be  any 
probability  measure  on  Borel  sets  of  0,  then  —  simply  because  a  maximum  is 
at  least  as  large  as  an  average,  which  in  turn  is  at  least  as  large  as  a 
minimum  —  we  have 


sup  „  p  (a,y)  £  /  P  (a,y)y(da,dy) 
w  l  0  x 


(B.l) 


and 


J  Pxa,yP(da’dY>  *  infR  Px, 


e 


G  Ta,y 


(B.  2) 


Hence,  if  we  could  find  a  y  for  which  the  right  side  of  (B.l)  is  equal  to  the 
left  side  of  (B.2),  the  proof  would  be  complete.  This  is  not  unreasonable  to 
hope  for,  because,  as  will  be  seen  below,  the  requirement  is  that  y  be  right 
invariant,  i.e.,  that  for  each  G  C  G  and  (a,y)  G  0,  y(G  o  (a,y))  =  y(G), 


where 
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(a',Y')  o  (a,y)  =  (a' +  ay' , y' y)  (B.3) 

and 

G  o  (a, y)  =  {(a',Yf)  0  (a>y):  (a'.y')  e  G}  .  (B.4) 

A  right  invariant  measure  does  indeed  exist,  but  it  is,  unfortunately, 
necessarily  an  infinite  measure;  e.g.,  7r.Q(da,dY)  =  y  ^dady  is  one  such, 
as  can  easily  be  checked  (see  Zacks  (1971),  pp.  334-335).  Hence  we  must  resort 
to  showing  that 

/  (pT(d»y)  ~  PTa,Y)lTm(doi’dY)  =  °  •  (B-5) 

where 

it  (G)  =  irn(GG  )/tt0(G  ) 
m  u  m  0  m 

and  the  G  '  s  are  successively  larger  subsets  of  0  with  finite  ir_  measure, 
m  0 

Let  =  [0,lln  and,  for  each  z  6^  ,  let 
n  L  J  ~  n 

G±(z)  =  {(ct,y)  e  0:  r(a  +  yz)  =  i}  ;  (B.6) 

so  G^(z) jG^z) , . . . ,Gn(z)  is  a  measurable  partition  of  ©  .  It  follows  from 
(B.3)  and  (B.4)  that  for  any  (a,b)  e  ©, 

G_l(z)  o  (a, b)  =  {(a,y)  6  0:  f(a  +  y(z-a)/b)  =  i}  .  (B.7) 


Now,  using  the  right  side  of  (3.4),  then  interchanging  the  order  of 
integration,  the  right  side  of  (B.l)  becomes 


J  p(da,dy)  J 

6  W 

n 


n 

l 

i=2 


Wi(5)I{z:T(a+Yz)=i>d~ 


l  ! 

i-2 


wi(z)p(Gi(z))dz  . 


(B.  8) 
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And,  from  (3.10)  —  with  T  =  T  —  followed  by  an  interchange  of  the 

a,  y 

order  of  integration  and  (B.7),  the  left  side  of  (B.2)  becomes 


/  y(da,dy) 
0 


±|2  Wi^~^I{z:t(a+Y(z-z1)/|z2-z1 


-  £  L 

1=2  W 


wi(z)y{Gi(z)  o  (zlS |z2-z1|)}dz  . 


(B.9) 


Comparing  (B.8)  and  (B.9)  we  see  that  right  invariance  of  y  does  indeed 
imply  equality  of  the  right  side  of  (B.l)  and  the  left  side  of  (B.2).  Take 
y  =  tt  q  =  y  ^dady,  and 

G  =  {(a, y)  e  0:  lot!  £  A  ,  0  <  b  £  Y  <  B  <  °°} 

m  1  1  m  m  m 

where  A  /  <»,  b  i  0,  6  /  °°  and  B  / £n  B  =  °(A  )  .  Define  7r  by 

m  m  "  m  m  m^m  m  7 

7r(G)  =  7Tn  (GG  )/TTn(G  )  . 
m  U  m  U  m 

Then,  from  (B.8)  and  (B.9), 

/  |p  (o,y)  -  pT  |tt  (da,dy) 

g  *■  Y  m 

n 

=  I  /  W.  (z)  |ir  {G  (z)  -  7T  {G.  (z)  °  (z,  ,  |z„-z  I  )}|  dz  .  (B .10) 

i=2W  m  J-zj. 


Now,  denoting  (z^,  j z 2 — z )  by  g(z),  we  have 

■jrn({G.  (z)  o  g(z)}  n  G  )  =  7Tn(G.  (z)G  o  g(z)) 
Ui~  m  ui~m 

+  7T0({(G.(z)  -  Gm)  O  g(z)}  n  Gm) 

-  VGi<2>Gm  °  -  G„) 


(B.ll) 
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The  first  term  on  the  right  side  of  (B.ll)  equals  u,.(G.(z)G  )  because 

U  1  ~  m 

7Tq  is  invariant.  The  second  and  third  terms  are  at  most 

V{Gm  °  S(Z)  }  0  Gm}  =  71 0 (Gm  "  °m  °  g(5})’  and  V{Gm  °  g(^)}  "  V 

respectively.  Thus  the  right  side  of  (B.10)  is  at  most 


n 


l  !lwi(-)|l 

i-2  K 


1T0(GmA{Gm°  g(5)}) 


VGJ 

U  m 


dz  , 


where  [|w.(*)||  =  sup  w.(z)  which  is  finite  by  assumption.  So  to 
i  ok „  i 

establish  (B.5)  all  we  need  to  show  is  that,  for  each  z  6 ^  with 
|z  -zx[  >  0, 


it  (G  A{G  o  g(z)l) 
0  m  m  ~ 

TTn  (G  ) 

0  m 


0  • 


Using  (B.3)  with  (a,y)  =  (z^.lz^-z^l)  we  can  verify  that 


GmA{Gm  °  g(?)}  C  {lal  «  V  bm  *  ^  *  hJ 

U 

{|a  -  Am|  £  jzjy,  bm  <  y  <  Bm} 

u 

{|a  +  Aj  «  Izjy,  bm  *  y  *  Bm> 

u 

{-  A  -z  y$a<A  -  z  y,  B  $  y  <  B  /  |  z„-z.  |  } 
m  1  m  1  m  m  '  2  1 ' 


VGmA{Gm°g(^)}) 

W 


2A  £n  z„-z, 
m  2  1 

$  - 


+  4|z..|(B  -b  )  +  2A  j&n|z0 
_ i  mm _ m _ £ 

2  A  £n  ( B  /  b  ) 
m  mm 


Hence 
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which  goes  to  zero  provided  B^/£n  =  ° (Am) . 


m 


m 


Thus  (B.5)  holds,  which  completes  the  proof  of  the  proposition.  D 


The  author  is  grateful  to  James  Berger,  Jack  Kiefer,  Herman  Rubin, 
and  the  referees  for  their  helpful  suggestions  and  to  the  Stanford 
University  Department  of  Statistics  for  its  hospitality  during  the 
preparation  of  this  paper. 
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2.  Asymptotic  Form  of  Minimax  Rule  in  Quantile  Problem 


lim  n(l-t)c 


[ntl 


(n) 


.288 

.0041 

.289 

.0127 

.29 

.0212 

.30 

.1035 

.40 

.7199 

.50 

1.1128 

.60 

1.3894 

.70 

1.5967 

.80 

1.7593 

.90 

1.8908 

.95 

1.9478 

.99 

1.9899 

.995 

1.9950 

.999 

1.9990 
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1.  Solution  to  Quantile  Problem  for  n  =  3,  4,  5,  and  6 


2 

n 

_4 

2 

2 

c5(n) 

— 

— 

1.0000 

.5000 

ip5(n) 

— 

— 

— 

.5000 

e4(n) 

— 

1.0000 

.5000 

.3056 

^4(n) 

— 

— 

.5000 

.3833 

c3(n) 

1.0000 

.5000 

.2750 

.1470 

^3(n) 

— 

.5000 

.3875 

.3235 

c2(n) 

.5000 

.1875 

.0609 

-.1084 

V10  '  bn-2 

.5000 

.3958 

.3354 

.2972 

m 

n 

.4167 

.3646 

.3344 

.2972 
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